Resolving Ambiguities in Toponym Recognition in Cartographic Maps
نویسندگان
چکیده
To date many methods and programs for automatic text recognition exist. However there are no effective text recognition systems for graphic documents. Graphic documents usually contain a great variety of textual information. As a rule the text appears in arbitrary spatial positions, in different fonts, sizes and colors. The text can touch and overlap graphic symbols. The text meaning is semantically much more ambiguous in comparison with standard text. To recognize a text of graphic documents, it is necessary first to separate it from linear objects, solids, and symbols and to define its orientation. Even so, the recognition programs nearly always produce errors. In the context of raster-to-vector conversion of graphic documents, the problem of text recognition is of special interest, because textual information can be used for verification of vectorization results (post-processing). In this work, we propose a method that combines OCR-based text recognition in raster-scanned maps with heuristics specially adapted for cartographic data to resolve the recognition ambiguities using, among other information sources, the spatial object relationships. Our goal is to form in the vector thematic layers geographically meaningful words correctly attached to the cartographic objects.
منابع مشابه
Combining Sources of Evidence to Resolve Ambiguities in Toponym Recognition in Cartographic Maps
Graphical documents such as cartographic maps contain a great variety of textual elements appearing in different spatial positions, in different fonts, sizes, and colors, touching and overlapping graphical symbols. This greatly complicates automatic optical recognition of such textual elements in the process of raster-to-vector conversion of graphical documents. In this work, we propose a metho...
متن کاملError Detection and Correction in Toponym Recognition in Cartographic Maps
At present a lot of methods and programs for automatic text recognition exist. However there are no effective text recognition systems for graphic documents. Graphic documents usually contain a great variety of textual information. As a rule the text appears in arbitrary spatial positions, in different fonts, sizes and colors. The text can touch and overlap graphic symbols. The text meaning is ...
متن کاملToponym recognition in custom-made map titles
The titles of customized topographic maps constitute a specific corpus which is characterized by a very significant number of place names and spelling variations. This paper is about identifying toponyms in these titles. The toponym tracking is based on gazetteers as well as light parsing according to patterns. The method used broadens the definition of the toponym to include the nature of the ...
متن کاملPoint-feature lettering of high cartographic quality: A multi-criteria model with practical implementation
There have been numerous and varied research efforts to automate point-feature label placement (PFLP). It seems that many well-established precepts for pointfeature annotation used by human cartographers have been neglected so far. As a consequence, the currently implemented, fully automated solutions are limiting computer generated maps in their expressive power. In this paper we present a com...
متن کاملRecognition of Cartographic
Sushil Bhattacharjee Gladys Monagan Institut f ur Informationssysteme Swiss Federal Institute of Technology (ETH) ETH-Zentrum, CH-8092 Zurich, Switzerland ABSTRACT A hybrid (statistical/structural) approach is presented, for scaleand orientation-invariant recognition of multi-component cartographic symbols. A decision-tree classi er (DTC) is used to identify the shapes of the individual compon...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003